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Transmembrane domain: 
N-glycosylation site: 



Amino acids 51-7 0 
Amino acids 224-228 



cAMP- and cGMP- dependent protein kinase phosphorylation sites: 

Amino acids 46-50 ; 118-122 



N-myristoylation sites: 



Amino acids 50-56 ; 129-135 ; 341-347 ; 
357-363 



Fibrinogen beta and gamma chains C-terminal domain signature: 

Amino acids 3 96-409 

(57) Abstract- The invention concerns compositions and methods for the diagnosis and treatment of neoplastic cell ^owth and 
SfcaSn mammals, tnc.ud.ng human, The invention is based upon the identification of genes that are amphhed . the genom 
of tumor cells Such gene amplification is expected to be associated w,th the overexpress.on ot the gene product as compared to 
n Z e is of the same ts" P Ue and contribute to tumorigenesi, Accordingly, the proteins encoded by ft. amphfied genes are 
behTed to be useful targets for the diagnosis and/or treatment (including prevention) of certain cancers and may act as predtctors 
5 Sgnosis tumor tiatment. The present invention is directed to novel polypeptides and to nucle,c ac,d molecules encoding 
Lse p'olypeptide, Also prov.ded heretn.e vectors and host cells comprising those nucleic acid sequences, chrmenc po ypepnde 
moleculefcompn-g the'polypeptides of the present invention fused to heteno logons po ypeptt e wh.ch b,nd 

to the polypeptides of the present invention and to methods for producmg the polypeptides of the present mvention. 
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of Washington, Seattle, Washington). One or more of the ESTs used in the assembly was derived from a library 
constructed from tissue obtained from the parotid (salivary) gland of a human with parotid cancer. The consensus 
sequence obtained therefrom is herein designated as DNA56019. 

In light of an observed sequence homology between the DNA56019 sequence and Incyte EST clone no. 
5 1327836, Incyte EST clone no. 1327836 was purchased and the cDNA insert was obtained and sequenced. The 

sequenceofthisc^NA^^^ 

The entire coding sequence of DNA64884-1527 is included in Figure 27 (SEQ ID NO:27). Clone 
DNA64884-1527 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 79-81 and ending at the stop codon at nucleotide positions 391-393 (Figure 27). The predicted 
10 polypeptide precursor is 104 amino acids long (Figure 28; SEQ ID NO:28). The full-length PRG1245 protein 
shown in Figure 28 has an estimated molecular weight of about 10,100 daltons and a pi of about 8.76. Analysts 
of the full-length PR01245 sequence shown inFigure 28 (SEQ ID NO:28) evidences the presence of a vanety of ■ 
important polypeptide domains, wherein the locations given for those important polypeptide domams are 
approximate as described above. Analysis of the full-length PR01245 sequence shown inFigure 28 evrdences the 

15 preS en«ofthefoUow^ 

from about amino acid 8 to about amino acid 14, from about amino acid 65 to about amino acid 71, from about 
amino acid 74 to about amino acid 80, and from about amino acid 88 to about amino acid 94; and a prokaryotrc 
membrane lipoprotein lipid attachment site from about amino acid 5 to about amino acid 16. Clone DNA64884- 
1527 has been deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 203155. 

20 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 28 (SEQ ID NO:28), evidenced some homology 
between the PR01245 amino acid sequence and the following Dayhoff sequences: SYA_THETH, GEN11167, 
MTV044_4, AB011151.1. RLAJ2750_3, SNELIPTRAJ, S63624, C28391, A37907, and S14064. 

EXAMPLE 17 

25 Tsolation of cDNA Clones F nr.orfmg Human PRO 1759 

DNA76531-1701 was identified by applying the proprietary s lg nal sequence finding algorithm described 
inE X ample2above.Useoftheabovedescribedsign a l sequence algorithm allowed identification of an EST cluster 
sequence from the LIFESEQ® database, Incyte Pharmaceuticals, Palo Alto, CA, designated DNA1 057 1 This EST 
cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included pubhc 
30 EST databases (,,., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo 
Alto CA) to identify existing homologies. The homology search was performed using the computer program 
BLAST or BIAST2 ( - — ' - - ' Methods in F.nzvmol 0g v. 266:460-480(1996)). Those comparisons resulting 
in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
35 Seattle, Washington). One or more of the ESTs used in the assembly was derived from pooled eosinophds of 
allergic asthmatic patients. The consensus sequence obtained therefrom is herein designated as DNA57313. 

In light of an observed sequence homology between the DNA573 13 sequence and Incyte EST 2434255, 
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thecloneincludingthis^^ 

sequenceoftmscDNAinsertisshowninFigure 29 (SEQ ID NO :29) and is herein designated asDNA76,31-1701. 

Tta entire coding sequence of DNA76531-1701 is included in Figure 29 (SEQ ID NO:29). Clone 
DNA7653 1-1701 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 125-127 and ending at the stop codon at nucleotide positions 1475-1477 (Figure 29). The predicted 
polypeptide precursor is 450 amino acids long (Figure 30; SEQ ID NO:30). Tne full-length PR01759 protein 
shown in Bgure 30 has an estimated molecular weight of about 49,765 daltons and a pi of -about 8.14. Analysts 
of the full-length PR01759 sequence shown in Figure 30 (SEQ ID NO:30) evidences the presence of a vanety of 
important polypeptide domains, wherein the locations given for those important polypeptide domams are 

) app^onuaeasdescri^ 

presence of the following: a signal peptide from about amino acid 1 to about amino acid 18; transmembrane 
domains fromabout amino acid 41 to about amino acid 55, from about amino acid 75 to aboutamino acid 94, from 
about amino acid 127 to about amino acid 143, from about amino acid 191 to about amino acid 213, from about 
amino acid 249 to about amino acid 270, from about amino acid 278 to about amino acid 299, from about annuo 

5 add314toaboutaminoacid33.0,f^^ 

to about amino acid 394, and from about amino acid 410 to about amino acid430;acAMP-andcGMP-de P endent 

proteinMnasephosphorylauonsite^^^ 

about ammo acid 11 to about amino add 17, from about amino acid 18 to about amino acid 24, from about ammo 
acid 84 to about amino acid 90,*om about amino acid 92 to about amino acid 98, from about amino acid 137 to 
20 about amino acid 143, from about amino acid 138 to about amino acid 144, from about amino acid 238 to about 
amino acid 244, from about amino acid 253 to about amino acid 259, from about amino acid 278 to about ammo 
acid 284, and from about amino acid 282 to about amino acid 288; an amidation site from about ammo acd 102 
to about amino acid 106; and a prokaryotic membrane lipoprotein lipid attachment site from about amino actd 6 to 
about amino acid 17. Clone DNA7653M701 has been deposited with ATCC on November 17, 1998 and „ 

25 assigned ATCC deposit no. 203465. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 30 (SEQ ID NO.30). evidenced sequence identrty 
betweenmePR01759annnoacidsequenceand^^ 

S67684, RGT2.YEAST, S68362, ATSUGTRPR_1, P.W17836 (Patent application WG97 15668- A2), F69587, 
30 A48076, andA45611. 

F.XAMPLE 18 
T^i Q ri™ r^DNA Clone* Finding Human PRQ5775 
DNA96869-2673 was identified by applying the proprietary signal sequence finding algorithm described 
inE X ample2above. Use of the above described signal sequence algorithm allowed identification of an EST cluster 
35 sequence from the LIFESEQ® database, Incyte Pharmaceuticals, Palo Alto, CA, designated herein as CLU86443. 
Tins EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which 
included public EST databases (, g „ GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte 
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HCT116, SKCOl, and LS174T. 

Because amplification of DNA58801-1052 occurs in various tumors, it is highly probable to play a 
1 significant role in tumor formation or growth. As a result, antagonists (e.g. , antibodies) directed against the protein 
encoded by DNA58801-1052 (PR0779) would be expected to have utility in cancer therapy. 

5 PRQ1185 (DNA6288M515): 

The ACt values for DNA62881-1515 in a variety of tumors are reported in Table 7A. A AGt of >1 was 
typically used as the threshold value for amplification scoring, as this represents a doubling of gene copy. Table 
7A indicates that significant amplification of nucleic acid DNA62881-1515 encoding PROl 185 occurred: (1) in 
primary lung tumors: LT3, LT30 and LT26; and (2) in primary colon tumor CT2. 
10 Because amplification of DNA62881-1515 occurs in various tumors, it is highly probable to play a 

significant role in tumor formation or growth. As a result, antagonists (e.g. , antibodies) directed against the protein 
encoded by DNA6288 1-1515 (PROl 185) would be expected to have utility in cancer therapy. 

PRQ1245 (DNA64884-1527V 

The ACt values for DNA64884-1527 in a variety of tumors are reported in Table 7A. A ACt of >1 was 
15 typically used as the threshold value for amplification scoring, as this represents a doubling of gene copy. Table 
7A indicates that significant amplification of nucleic acid DNA64884-1527 encoding PRO 1245 occurred: (1) in 
primary lung tumors: LT13, LT15 and LT16; (2) in lung tumor cell line H522; and (3) in primary colon tumor 
CT15. 

Because amplification of DNA64884- 15 27 occurs in various tumors, it is highly probable to play a 
20 significant role in tumor formation or growth. As a result, antagonists (e.g. , antibodies) directed against the protein 
encoded by DNA64884-1527 (PR01245) would be expected to have utility in cancer therapy. 

PRQ1759 (DNA76531-170n: 

The ACt values for.DNA76531-1701 in a variety of tumors are reported in Table 7B. A ACt of >1 was 
typically used as the threshold value for amplification scoring, as this represents a doubling of gene copy. Table 
25 7B indicates that significant amplification of nucleic acid DNA76531-1701 encoding PR01759 occurred: (1) in 
primary lung tumors: HF-000840 and HF-001296; and (2) in primary colon tumor center HF-000795. 

Because amplification of DNA76531-1701occurs in various tumors, it is highly probable to play a 
significant role in tumor formation or growth. As a result, antagonists (e.g. , antibodies) directed against the protein 
encoded by DNA76531-1701 (PR01759) would be expected to have utility in cancer therapy. 

30 PRQ5775 (DNA96869-2673): 

The ACt values for DNA96869-2673 in a variety of tumors are reported in Table 7B. A ACt of >1 was 
typically used as the threshold value for amplification scoring, as this represents a doubling of gene copy. Table 
7B indicates that significant amplification of nucleic acid DNA96869-2673 encoding PR05775 occurred: (1) in 
primary lung tumors: HF-00063 1, HF-00064 1, HF-000643, HF-000840, HF-000842, HF-001293, HF-001294, HF- 
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